The WWW Based Data Mining Toolbox Architecture
نویسندگان
چکیده
This paper presents the Data Mining (DM) toolbox architecture based on cutting edge World Wide Web (WWW) technologies. The DM toolbox is used to discover new and useful knowledge by integrating results generated by multiple DM tools. The proposed architecture allows submission of data to the DM toolbox and generation of results that combine knowledge generated by several different DM tools. The DM toolbox dynamically finds DM tools that are relevant to a specific data mining task, submits the data to the tools, receives results of their analysis, and combines the results to generate a final report. The proposed architecture will increase the usability of DM tools, helping achieve a more consistent and better integrated Data Mining and Knowledge Discovery (DMKD) process.
منابع مشابه
A Robust System Architecture for Mining Semi-Structured Data
The value of extracting knowledge from semi-structured data is readily apparent with the explosion of the WWW and the advent of digital libraries. This paper proposes a versatile system architecture for text mining that maintains structured data components in a relational database and unstructured concepts in a concept library. After a detailed explanation of our system architecture, we briefly...
متن کاملImplementing Service Oriented Architecture for Data Mining
With Web technology, data on internet has become increasingly large and complex. No matter users or internet users needs all this data. Also the data which is available on web not all the time useful information or it is knowledgeable. Hence web data mining is necessary to fulfill this demand. Web data mining can extract unstructured, undiscovered data which is possibly useful information and k...
متن کاملA Toolbox Approach to Flexible and Efficient Data Mining
We are describing an approach to data mining with a toolbox based on Python scripts, which allows tackling common tasks occurring in data mining in a flexible and efficient way. Using either a relational database, text or binary files the toolbox gives the user a uniform view of the data collection. Two core features of the toolbox are caching of database queries and parallelism within a collec...
متن کاملAgent Based Framework for Semantic Web Content Mining
With flooding of information on WWW it has become necessary to apply some strategy so that valuable knowledge can be extracted and consequently returned to the user. Data mining techniques find their applicability in such scenario. Data mining concepts and techniques when applied to WWW with its existing technologies are termed as web mining. Web mining can change the way results are provided t...
متن کاملAdaptive Neuro-Fuzzy Inference System application for hydrothermal alteration mapping using ASTER data
The main problem associated with the traditional approach to image classification for the mapping of hydrothermal alteration is that materials not associated with hydrothermal alteration may be erroneously classified as hydrothermally altered due to the similar spectral properties of altered and unaltered minerals. The major objective of this paper is to investigate the potential of a neuro-fuz...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002